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ABSTRACT 

High precision cosmological distance measurements towards individual objects 
such as time delay gravitational lenses or type la supernovae are affected by weak 
lensing perturbations by galaxies and groups along the line of sight. In time delay 
gravitational lenses, "external convergence," Koxt, can dominate the uncertainty in 
the inferred distances and hence cosmological parameters. In this paper we attempt 
to reconstruct Koxt, due to line of sight structure, using a simple halo model. We 
use mock catalogues from the Millennium Simulation, and calibrate and compare our 
reconstructed P(Kext) to ray-traced Kgxt "truth" values; taking into account realistic 
uncertainties on redshift and stellar masses. We find that the reconstruction of Atoxt 
provides an improvement in precision of ~50% over galaxy number counts. We find 
that the lowest-Kext lines of sight have the best constrained P(«;ext)- In anticipation 
of future samples with thousands of lenses, we find that selecting the third of the 
systems with the highest precision Kext estimates gives a sub-sample of unbiased 
time delay distance measurements with (on average) just 1% uncertainty due to 
line of sight external convergence effects. Photometric data alone are sufficient to 
prc-select the best-constrained lines of sight, and can be done before investment in 
light-curve monitoring. Conversely, we show that selecting lines of sight with high 
external shear could, with the reconstruction model presented here, induce biases of 
up to 1% in time delay distance. We find that a major potential source of systematic 
error is uncertainty in the high mass end of the stellar mass-halo mass relation; 
this could introduce ^2% biases on the time-delay distance if completely ignored. 
We suggest areas for the improvement of this general analysis framework (including 
more sophisticated treatment of high mass structures) that should allow yet more 
accurate cosmological inferences to be made. 

Key words: gravitational lensing ~ methods: statistical - galaxies: halos - galaxies: 
mass function - cosmology: observations 



1 INTRODUCTION 

Every distant object we observe has had its apparent shape 
distorted, and size and total brightness magnified (or de- 
magnified) by a compound weak gravitational lens con- 
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str ucted from all the m ass d istributed between u s and it. 
As IVale fc Whi"t3 (|2003l ) and iHilbert et al] (|2007l l showed, 
there are no empty lines of sight through our universe. 
This fact makes gravitational lensing a potentially impor- 
tant source of systematic error for any estimate of lumi- 
nosity (or distance); this issue ha s been raised for e.g. 



type l a supernovae bv lHolz fc midi (119981): IHoIz fc Lindej 



20051 ) , for gamma ray bursts by Oeuri fc Takahashil 1 200l 
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Wang fc Pail |201lh . and for high redshift galaxies by 



Wvithe et all (|201lh . among others. 



Along lines of sight containing strong gravitational 
lenses, the perturbative effects of line of sight mass struc- 
ture have been found to be particularly important, with 
foreground and background structures having a signif- 
icant effect on the inferred lensing cross-section (e.g . 
IWonget aO 120121') a r id dis tance ratios (|Dalal et al.ll2005l ). 
Indeed, ISuvu et al found that in time delay lens 

cosmography the so-called "external convergence" Kcxt due 
to mass structures along the (unusually over-dense) line 
of sight to the quadruply-imaged radio source B1608-f656 
had to be included in their analysis, and was the dominant 
source of uncertainty in their 5% measurement of the time 
delay distance. Large samples of time-delay lenses are ex- 
pected to be discovered i n the next decade in grou nd-based 
optical imaging surveys (|Oguri fc Marshall|[2O10l ): the ex- 
ternal convergence will have to be understood increasingly 
well in order to prevent its correction from dominating the 
systematic error budget. 

While it is rare for three galaxies to line up well enough 
for both of the background sources to be strongly lensed 
(|Gavazzi et all l2008l : [CoUett et al] 120121 '). the large size of 
dark matter halos makes partial alignments - such that 
they act as perturbing weak lenses - a near certainty. 
The large scale of the perturbers means the external lens- 
ing perturbations can be approximated by a quadrupole 
lens characterised only by external convergence and shear. 
The external shear, 7ext can potentially be recovered in 
the mass-modelling of the lens, but the convergence is un- 
detectable from the image positions, shapes and relative 
flux es; this is the well-known mass-sh eet degeneracy (see 
e.g. iFalco. Gorenstein. fc Shapiroll 19851 . for details). 

Since the mass-sheet degeneracy prevents any estimate 
of Kcxt from the strong lens modelling, additional informa- 
tion is required. Weak lensing measurements can be used 
for r ich lines of sight (|Nakaiima et all l2009l : | Fadelv et al.l 
I2OIOI I. but for lines of sight with low galaxy density we 
must attempt to reconstruct the distribution of mass along 
the line of sight. Attempts to reconstruct the mass distri- 
bution in strong lens fields have focused on understanding 
the external shear which seems to be lar ge i n most strong 
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iFassnaciit c t al. ( 20061 ) all found groups of galaxies hosting 
or near to known gravitational lenses. Is generally found 
that lens galaxies, reside in over-dense environments, in- 
distinguishable from those occupied by similarly massive 
galaxies (|Augeij|200i iTreu et al.lboogl 'l 'Wong ct al.' ('201l|) 
esti mated Pr(7ext) given th e data of [Williams ct al. (2006) 
and iMomcheva et al] (|2006l ) but found significant discrep- 
ancy between the predicted shear distribution a nd the exter- 
nal shear demanded by the strong lens model. IWong et al.l 
also found that both line of sight structures and the group 
of galaxies in each lens plane contribute significant propor- 
tions of the shear. 

For lines of sight without strong lenses, magnifi- 
cation is the mo r e rele vant quantity than convergence; 
iGunnarsson et al] (|2006h used a galaxy halo model com- 
bined with simple scaling relations to reconstruct super- 
novae lines of sight and found that the dispersion in ap- 
parent source brightness due to lensing magnificatio n could 
be red uced by a factor of two. Converselv Karpcn ka et al.l 
(I2OI2D used the brightness dispersion in type la supernovae 



to infer the parameters of both their galaxy halo m odel 
and mass-t o-light scaling relation. Both lKarpenka et al.l and 
|jonsson~et al. (2010,) were able to detect lensing effects in 
the supernova data this way. 

Large numerical simulations can also be used to esti- 
mate globa l convergenc e distr ibutions. iHolder fc Schechterl 
|2003D and iDalal et al.1 (|2005l l car ried out ray-tracing cal- 
culations in N-body sim ulations (|Kauffmann et ahl Il999l : 
IWambsganss et al.ll2004) to estim a te the distribution of ex- 
ternal shear values HHilbert et al.l (|2009l ) performed similar 
ray-tracing experime nts through the Millennium Simulation 
(|Springel et al.l [20051 ). gene rating a predicted K ext at every 
position in a simulated sky. iHilbert et al.l (|2009l ) found that 
after removing the matter on the strong lensing plane Mil- 
lennium Simulation lines of sight with strong lenses were 
not biased towards high Koxt , although selection functions 
for discovering samples of strong lenses were not taken into 
account. 

The Millennium Simulation resul ts have bee n used 
to analyse real observations as well. ISuvu et al.1 (|2010l ) 
selected Millennium Simulation lines of sight by their 
apparent galaxy over-density, in a 45 arcsec radius 
aperture down to i < 24.5, to match the observed 
over-density towards the time dela y lens B1608-f656 
(jFassnacht. Koopmans. fc Wondlionh . The resulting dis- 
tribution of Koxt values from the ray-tracing was taken to 
be an estimate of Pr(Kcxt), which was then marginalised 
over when inferring the time delay d istance in t h is sys - 
tem. A similar procedure was use d in Suvu et al.l (|2012| ). 
In a companion paper to this one, iGreene et al.l (|2013l ) in- 
vestigate improvements to this method by weighting the 
galaxy counts by observables including galaxy luminosity 
and perpendicular distance to the line of sight, again using 
the Millennium Simulation mock catalogues and their as- 
sociated Kext values to construct Pr(Kext|I'). For the most 
over-dense lines of sight their results show a ~20% improve- 
ment over number counts alone, but little improvement for 
less dense ones. 

In this paper ^ we combine the halo m o del reconstruc- 
tion approach of Gunnarsson et al.l (|2006l ). Ijonsson et al.l 
(|2010l ). IWong et all (|201ll ) a nd others, w i th the idea of cal- 
ibrating to simu lations from Suvu et aP (|2010l ) Taking the 
ACext values from lHilbert et al.l (|20od )'s rav-tracing calcula- 
tions as the "truth" that we have to recover, we use the Mil- 
lennium Simulation mock galaxy catalogues first to calibrate 
the reconstruction and account for unseen mass and voids, 
and then to test the accuracy of the line of sight mass recon- 
struction under this assumption. Probabilistically assigning 
mass and redshift to every observed galaxy in a given ob- 
served field, we generate Monte Carlo sample line of sight 
mass distributions, and so construct the probability distri- 
bution function (PDF) Pr(Kcxt|0) for that field. We then 
emulate the combination of many such PDFs to quantify 
the residual bias that would be translated to the global (in- 
cluding cosmological) parameters, if left unaccounted for. 
In doing so, we aim to answer the following questions: 

• Faint galaxies, filaments and dark structures will not 
appear in any photometric object catalogue, but they will 
contribute convergence at some level. How much of the total 
line of sight convergence comes from visible galaxies, and 
how much effect do dark structures and voids have? 

• Can the true line of sight convergence be recovered 
from a calibrated halo model reconstruction? What scatter 
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redshift z 



Figure 1. Four different views of a slightly over-dense Millennium Simulation line of sight, with a Kext of 0.03. Top row, left: The 
positions of galaxies projected on the sky. The area of the circles is proportional to observed j-band flux; the brightest object shown 
has an i magnitude of 17.7. Top row, centre: The angular sizes of halos projected on the sky. Red and blue regions lie within the 
NFW scale radius and virial radius of each halo, respectively. There are essen tially no empty lightcones. Top r ow, right: The ind ividual 
^ext contributions of each halo, assuming a iBaltz. Marshall, fc Qguril hOO^ truncated NFW profile and the lNeto et al ] ll2007l l mass- 
concentration relation. Comparison of this panel and the centre panel illustrates the relative importance of proximity to the line of 
sight. Bottom: A view along the redshift axis, showing only halos with \x\ < 0.3 arcmin. The area of the points is proportional to each 
halo's mass: the most massive halo shown has 1.6 X IO^^Mq. The optical axis is shown by the dashed line, while the dotted lines mark 
the lens and source pianos for a B1608-likc strong lens. 



and residual bias are induced by the reconstruction process, 
and how might these be reduced in the future? 

• Both the detectability of lenses, and the selection of 
sub-samples of them for further observation, could poten- 
tially induce a selection bias in external convergence. How 
should we select objects to achieve the highest accuracy 
convergence reconstruction? Can such a selection be made 
robustly, using the reconstruction results? 

This paper is organized as follows. We review the rel- 
evant gravitational lensing theory in Section [21 and the 
Millennium Simulation ray-traced convergences and mock 
catalogues in Section [S] before introducing our simple re- 
construction model in Section 2] We then test this model in 
two phases: first, in Section^ with known redshift and halo 
mass for every galaxy in a lens field, in order to quantify the 
irreducible uncertainty due to unseen mass, and second, in 
Section [51 with realistic observational uncertainties on the 
observed galaxies' stellar masses and redshifts. In Section[7] 
we investigate the potential systematic error induced by se- 
lecting a subset of lines of sight. In Section[Hlwe investigate 
systematic errors caused by assuming an incorrect stellar- 
mass relation. We discuss our results in Section [5] before 
concluding in Section [TIJl 

Throughout this paper magnitudes are given in the AB 
system and we adopt the Millennium Simulation's "concor- 
dance" parameters for our reference cosmology, i.e. h = 
0.73, Q.m = 0.25 and SIa = 0.75, where the symbols indi- 
cate the Hubble Constant in units of 100 km s~^ Mpc~^ 
and the matter and dark energy density of the Universe in 
units of the critical density. 



2 THEORETICAL BACKGROUND 

In strong lens systems where the source is time variable, 
the images do not vary simultaneously; the optical path 
length for each image is different due to relativistic and 
geometric effects. The difference in optical path length 
caus es a time-delay between the li ght c urves of each im- 
age. iFalco. Gorenstein. fc Shapiro! (| 19851 ') showed that the 
presence of additional matter (in the form of a mass-sheet) 
along the line of sight has no observable effect except to 
rescale the time-delays and magnifications. As such, it is 
necessary to include Kext in the lens modelling if cosmologi- 
cal parameters are to be est imated accurately and precisely 
from observed time delays u et al.ll2010h . 

If there is external convergence present that is not in- 
cluded in the lens modelling, then the time delay distance 
- inferred assuming Kext = - will be (1 — Kcxt) more than 
the true value of the time-delay distance Djst- 



We can hence estimate the true distance only if we 
have additional knowledge of ^ext. Since K^xt is typically 
small the absolute uncertainty on the estimate of Koxt corre- 
sponds to the fractional uncertainty with which time-delay 
distances can be inferred. 

Dynamical observations of the lens galaxy can help 
break the mass-sheet degeneracy by providing an add i tional 
estimate of the len s's mass (e.g. , iKoopmans et all l2003l : 
[Koopmans 2004; Su vu et al]|201of ). This constraint is use- 
ful in excluding the high convergence tails that are often 



© 2012 RAS, MNRAS 000, UHH 



4 Collett et al 



present in PDFs for convergence - we do not include dy- 
namical constraints in our analysis. 

In this paper we restrict ourselves to reconstructing the 
line of sight portion of the mass distribution in any given 
field, our lines of sight do not include a strong lens which 
allows us to work in the weak lensing regime. 

The impact of weak-lensing perturbations on strong 
lensing lines-of-sight is postponed to further work. 



3 THE MILLENNIUM SIMULATION 

In order to test the accuracy of our convergence estimates, 
we need to know the true convergence for each line of sight. 
We cannot use real lines of sight for this, but must use 
simulated lines of sight instead. In this section we briefly 
review the Millennium Simulation, the ray tracing calcula- 
tions that have been carried out in it, and the mock galaxy 

catalogues that have been produced^ 

The Millennium Simulation (|Springel et al.ll2005h IS a 
cosmological N-body simulation of Dark Matter structures 
in a cubic region approximately 680 Mpc in co-moving size, 
followed from redshift 127 to the present day. With an ap- 
proximate halo mass resolution of 2 x IO'^'^Mq (correspond- 
ing approximately to a galaxy with luminosity O.IL*), it 
provides a detailed prediction for the distribution of dark 
structures present in the Universe, under the assumptions of 
the ACDM model of hierarchical structure formation, cos- 
mological densities as given at the end of Section [T] and 
as, — 0.9. Galaxy properties, such as stellar masses, lumi- 
nosities and colours, were assigned to the simulated halos 
according to a semi-anal ytic model for galaxy formation 
(jPe Lucia fc BlaizotllioOTl l : the resulting predictions for the 
galaxy luminosity function and correlation function match 
the observational data very well. 

3.1 Convergence from Ray Tracing 

iHilbert et al] (|2009l ) calculated the lensing convergence us- 
ing the "lightcone" dark matter only output of the Mil- 
lennium Simulation, providing 1.7 degree square mock sky 
maps of this quantity. In this process, the dark matter 
density from the simulation was projected onto a set of 
lens planes at discrete redshifts, adaptively gridded and 
smoothed, and then the second derivatives of the lensing 
potential required for the components of the magnification 
matrix were computed using a multiple lens plane ray trac- 
ing algorithm. The firs t order approximat ion to this algo- 
rithm (equation 17 of iHilbert et al.l l2009l ) gives the total 
convergence at a given sky position 6 as the simple sum- 
mation of the surface density in each lens plane, weighted 
by the inverse critical surface density Ecr for that plane's 
redshift: 

IHilbert et al] (|2009l ) found that this approximation is accu- 
rate to a percent level in Kext, even on 30 arcsecond scales. 
This accuracy sufficient for our analysis, but the approxi- 
mation may need re-visiting in future work. 

To define a test line of sight, we draw a random sky 
position from within the convergence maps, bi-linearly in- 
terpolate between the pixels of the map, and store the result 
as the "true" convergence for that line of sight, Acjjxt''- 



The adaptive smoothing of the matter in the Millen- 
nium Simulation results in a spatial resolution of ~ 5kpc 
comoving in the densest regions, and ~ lOkpc on average. 
This resolution is su fficient to capture most of the variance 
in the convergence (jTakahashi et all I2OI1I '). Moreover, we 
seek to estimate the component of the convergence that is 
smooth on the angular scales of typical strong galaxy-scale 
lens systems (~ larcsec). Any variations of the external 
convergence on smaller scales can be directly recovered from 
the resulting image distortions in the lens modelling. 



3.2 Mock Photometric Galaxy Catalogues 

We construct mock photometric galaxy catalogues by se- 
lecting all Millennium Simulation galaxies within a circular 
aperture of a given angular radius centred on each randomly 
selected line of sight position, with apparent i-band mag- 
nit ude_briglite£_than a gi ven limit. The parent catalogue, 
from IHilbert et al.l l|201ll ). contains galaxy positions and 
magnitudes, all of which include lensing effects (deflections 
and magnifications). We also have access to the underlying 
galaxy halo masses and redshifts, which we will use to cal- 
ibrate the reconstruction, and also to explore any sources 
of bias and scatter. Since the "true" convergence is that of 
only the dark matter, we focus on reconstructing the dark 
matter halos alone. Our model for doing this is outlined in 
the next section. 



4 ESTIMATING CONVERGENCE: THE HALO 
MODEL APPROXIMATION 

We require a method for estimating the convergence at any 
point on the sky, given a catalogue of observed galaxy prop- 
erties such as positions, brightnesses, colours, and possibly 
stellar masses and redshifts, for every galaxy in the field of 
the lens, down to some magnitude limit. Since we cannot 
observe the surface mass density of each halo directly, we 
need some way of estimating the mass of each halo in the 
catalogue, so that we can compute its con tribution to the to- 
tal K e xt along the line of sigh t (as in e.g. iGunnarsson et all 
I2OO6I : iKarpenka et all 120121 '). This mass assignment recipe 
will be uncertain, and also incomplete, but it can be cali- 
brated with cosmological simulations, which represent a sig- 
nificant additional source of information to what is present 
in the data. 

Transforming an observed photometric galaxy cata- 
logue into a catalogue of halo masses, positions, and red- 
shifts will enable us to attempt a reconstruction of the con- 
vergence induced by every halo near a line of sight. We also 
need to account for the convergence due to dark structures 
and the divergence due to voids; we do not model voids or 
dark structures, but their effects are included statistically 
by calibrating the convergence in halos, Kh, against the ray- 
traced convergence, Koxt, for Millennium Simulation lines of 
sight. 

4.1 Halos 

Cosmological dark matter simulations have shown that dark 
matter halos are reasonably w ell-approximated by NFW 
profiles (|Navarro. Frenk. fc Wh ite 1997). We assume each 
halo to have a spherical mass distribution with density given 
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by 



p{r) 



Po 



{r/rs) (1 + t/ts) 



(3) 



Here, rs is a characteristic scale radius of the cluster, repre- 
senting the point where the density slope transitions from 

to . This radius is related to the virial radius of the 
halo by Ts = r2oo/c, where c is the concentration param- 
eter, which can be estimated from the halo's mass, using 
a mass-concentration relation. Typically more massive ha- 
los are less con centrated, but the re is some scatter; we use 
the relation of iNeto et all (|2007D to estimate c from the 
mass enclosed within r2oo, which we denote as M2 0. We 
find our results do not change if the iMaccio et all (|2008l l 
mass-concentration relation is used instead. 

If one integrates the density profile of an NFW profile 
out to infinite radius, the total mass diverges. Similarly, if 
the universe is homogeneously populated with NFW halos, 
the projected surface mass along any line of sight will also 
be divergent Q Since infinite mass is unphysical, the pro- 
file must be truncated at some point. Several truncation 
profi les have been suggested fe.g lBaltz. Marshall, fc Oguril 
I2OO9I I , but beyond several virial radii, the amount of matter 
associated with a halo is likely to be low. In this work we 
assume the truncated NFW profile 



p{r) = 



Pnfw('') 
1 + {r/nf 



(4) 



which is the same as the NFW profile in the limit 
that the truncation radius, rt goes to infinity; the shear 
and convergence from s uch a profile are derived in 
iBahz. MarshaU. fc Oguril (|2009l l. We use a truncation ra- 
dius of five times the virial radius, but our results are robust 
for any choice of rt > 2 x r2oo ■ 

Many studies have shown that galaxies have total mass 
density profiles that are approximately isothermal in their 
mner regions {e.g. lAuger et al.1 12OI0I I due to the more 
concentrated stellar mass component. This has been con- 
firmed by galaxy-galaxy weak lensing meas urements {e.g. 
iMandelbaum et al.|[2009l : iGavazzi et ahlbOOTl ') . We note that 
the two halo term that dominates the galaxy-galaxy lensing 
signal at large radii is explicitly taken into account in our 
model, where every galaxy is assigned a dark matter halo. 
At large radii an NFW-like profile is probably appropriate, 
but may not be correct when a halo is very close to the line 
of sight. 

Each halo in our catalogue contributes to the line of 
sight convergence by 



(5) 



where Ecr(zi,2s) is the critical s urface density . Foll owing 
the first order approximation of iHilbert et al.l (|2009l 'l out- 
lined in Section |3] above, we compute the total convergence 
from all the halos along the line of sight using 



(6) 



To estimate a halo's contribution to Kh we must first 
know the halo's mass and position. The stellar mass and 



^ At large radius, Snf„ cx: R^^ , but the differential number 
of halos centred within an annulus of width d/J is given by 
dA^annulus R dR, SO Stotal CX Jq°° R.~^ dR, which diverges loga- 
rithmically 



host halo are not necessarily concentric, especially for cen- 
tral galaxies of giant clusters, but our reconstruction as- 
sumes the dark matter halo is centred on the visible galaxy. 
For the lines of sight of the Millennium Simulation, halo 
masses are known, but for real lines of sight the halo masses 
have to be inferred from whatever data are available. In this 
work we investigate the use of the empirical stellar mass - 
halo mass relation for this purpose. The primary advantage 
of this approach is that it is a simple, "one size fits all" 
relation that can be applied regardless of the stellar mass 
observed. We discuss additional observables that could be 
used to improve the precision of the halo mass inference in 
Section |§] below. 

W e use the relation of iBehroozi. Conrov. fc Wechslerl 
to infer halo masses from stellar masses; the details 
of this procedure are outlined in Appendix |X] Given an un- 
certain stellar mass and redshift for each galaxy in the cat- 
alogue, we draw a sample halo mass from the PDF that de- 
scribes this uncertainty (Equation lAlf) and use Equation [5] 
to compute its contribution to the convergence, k^; this can 
be done for all the halos along a specific line of sight and 
summed to give a sample value of kh; repeatedly applying 
this procedure allows us to build up a histogram of Kh values 
consistent with the data, and hence characterize P(Kh\'D). 

This PDF contains a hidden assumption of an uninfor- 
mative prior PDF for Kh - in the case of infinitely poorly 
measured stellar masses and redshifts, P(Kh|f) will have 
very long tails corresponding to very over-dense lines of 
sight that we do not believe exist. The resolution of this 
is to divide out the effective prior PDF that was applied 
during the halo mass estimation process (which is broad, 
and hence close to uniform), and apply an additional prior 
on Kh, given by the global underlying Kh distribution from 
the simulation. In the limiting case of very poor photometric 
data, P(Kh|f) then defaults to this distribution, as required. 

4.2 Accounting for voids, filaments and other 
dark structures 

Our halo model accounts for the convergence contribution of 
density perturbations that can be associated with light. Fil- 
aments and dark substructures contribute additional con- 
vergence, while voids contribute a divergence; these struc- 
tures must be included to produce an unbiased estimate of 
Kcxt. In particular, neglecting voids would lead to a heav- 
ily biased estimate of Koxt, since the halo model's Kh can 
only be positive. In principle the absence of galaxies im- 
plies something about the presence of a void, but we do not 
currently attempt to use this information. In general, it is 
difficult to account for the unseen mass, since we do not 
have a good model for its density structure. 

The solution we present here is to calibrate the rela- 
tionship between the halo convergence Hh and the overall 
convergence Kext (as would be calculated by ray tracing 
through the full density field) using the simulations them- 
selves. The halo modelling procedure outlined in Section fOI 
allows us to estimate the convergence due to halos given ob- 
servations of mass, redshift and position, P{Kh\T>), but what 
we are interested in is the total convergence along the line 
of sight P(Kcxt|0). We can obtain this by considering the 
expression: 

Pr(Kext|I?)= J dKhPr(Kext!Kh,T?)Pr(Kh|I)) (7) 

The first term in the integrand relates the convergence due 
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to model halos, Kh, to the true convergence, Kext- This con- 
ditional distribution can be constructed horn the Millen- 
nium Simulation catalogues, by computing Kh from the true 
halo masses and redshifts along each selected line of sight, 
and then accumulating Kh, Kcxt pairs for a large number of 
such sightlines. For any given line of sight we can then esti- 
mate P(Kh|X') as described above, and then multiply it by 
Pr(Kcxt|«:h) and integrate out the intermediate parameter 

However, if the conversion between stellar mass and 
halo mass is very uncertain, P(Kh|D) is shifted relative to 
what it would have been given perfect knowledge of the halo 
masses. This is because the conversion from stellar mass to 
halo mass and into is highly asymmetric: a long tail of 
high halo masses gives rise to a tendency to overestimate 
Kh. If this shift is ignored, the resulting Pr(Koxt|I') will be 
systematically biased towards high values of Koxt- Instead, 
we need the distribution of ftcxt from all calibration lines of 
sight that have a P(Kh|2?) that is identical to the P(Kh|f ) for 
the real line of sight given the same data quality; however, 
we found this approach infeasible given our finite number 
of calibration lines of sight. A working compromise is to use 
the median of the V{k^\T>) distributions in the calibration. 

We take a large number of calibration lines of sight 
from the Millennium Simulation catalogues and generate 
a mock stellar mass catalogue for each one. We then re- 
construct each lightcone in the same way as we would an 
observed field, estimate P(fvh|C), and extract its median. 
Accumulating these median values k™*"^ and their corre- 
sponding true values of Kext, we can form the conditional 
distribution P(Kext|«;™'^'*). Then, to infer Kext from an ob- 
served photometric catalogue, we estimate P(Kh|I') using 
the halo model, compute the median value K™obs5 s-^d then 
use a modified version of Equation [7] to infer Kcxt : 



P/ I mcd 

r(Koxt|Kh ■ 



V) 



d^r^' pr(KextiK^^©) vrit^r^m 



(8) 



This integral is trivial, since Pr(K™°'^|D) is a delta function 
centred on the median of the inferred PDF for Kh. However, 
because we only have a finite number of calibration lines 
of sight, we use all calibration lines of sight with K™cai ~ 
Kr'' ± 0.003 to form Pr(Kext|Kr'*, D). 

This procedure of reducing the full PDF for Kh to its 
median increases the importance of the realism of the sim- 
ulation: the simulation needs to be as similar as possible to 
the real universe or there is a possibility of systematic errors. 
Conversely, making use of all the information in the photo- 
metric catalogue yields Kext estimates of increased precision. 



5 RESULTS: PERFECT HALO DATA 

We now put the reconstruction procedure outlined above 
into practice, first inferring Kcxt from Kh in the case of hypo- 
thetical galaxy catalogues with noiseless redshifts and halo 
masses. The reason for doing this is to check the validity 
of the calibration process, and then also to investigate the 
primary sources of the convergence. It will also provide us 
with a measure of the intrinsic uncertainty introduced by 
our assumptions of all illuminated halos being spherically- 
symmetric truncated NFW halos following the Neto et al 
concentration-mass relation and of visible galaxies being 
concentric with their host halos. 




Figure 2. The conditional distribution of Koxt given Khi in the 
case where we have perfect knowledge of halo mass and redshift. 
10^ reconstructed lines of sight were used to make this plot. Solid 
(dashed) lines enclose 68% (95%) of the conditional probability. 
Kh traces Koxt, but with a positive offset which arises from Kh 
not accounting for any voids or dark structures. At fixed Kh the 
scatter in Kcxt grows rapidly with Kh. The thin diagonal line 

follows Kcxt = Kh. 



Sum of Reconstructed P(;i ) 

Global P(k) 




Figure 3. Recovering the global P(Kcxt) (shown in red). The 
black curve shows a histogram of inferred Kext values from 10^ 
reconstructed lines of sight. The reconstructions were performed 
given perfect knowledge of halo mass and redshift — in this case 
the reconstruction recovers the correct convergence. 



5.1 How is Kh related to Kext? 

Figure [2] shows the conditional distribution of Kcxt given Kh, 
derived from lO'' randomly selected Millennium Simulation 
lines of sight and assuming noise-free halo masses and red- 
shifts. In this case, P(Kh|I') is almost a delta function since 
the only uncertainty is halo concentration, which has mini- 
mal effect on P(Kh|I'). At fixed Kh we find in Figure [2] that 
the scatter in Kext grows with Kh; our reconstruction is bet- 
ter at reproducing under-dense lines of sight than over-dense 
lines of sight. The effect of ignoring voids and the smooth 
mass component is evident in this plot: Kh is significantly 
higher than Kcxt, at any given Kcxt value. Conditioning on 
Kh gives a PDF for Kext centred at the correct place, by 
construction. 



5.2 How accurate is the Kh — Kext calibration? 

Let us define the "bias" of the reconstruction of a given line 
of sight as the difference between the expectation value of 
Kext over its posterior PDF P(Kext|I') and the known true 
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Figure 4. Which objects dominate the external convergence for a line of sight? Prom left to right, the three figures show the cumulative 
contribution to the total convergence from individual halos (fth) as a function of 1) distance from the line of sight, 2) magnitude and 
3) redshift. In each panel, the solid line is the median cumulative contribution over a large sample of lines of sight, while dashed and 
dot-dashed show the ranges enclosing 68 and 95 percent of the lines of sight respectively. The source is at redshift 1.4. 



value ftoxt". Let us also define the "width," CTk, to be half 
the width of the interval containing the central 68% of the 
posterior probability P(/s;cxt|I'). 

In an ensemble of 10^ reconstructed lines of sight, we 
find the mean bias to be —1 x 10~^, and the mean width 
to be 0.01. This is 1.8 times smaller than the width of the 
global P(Kcxt). We find that our inferred ftext PDFs are 
consistent with the global Koxt distribution; Figure |3] shows 
that sampling from the P(Kcxt|I') inferred from reconstruc- 
tions of 10^ lines of sight produces a distribution of Koxt 
values over the sky nearly identical to the global P(Ai;oxt). 
Given perfect knowledge of halo mass and redshift then, the 
calibration procedure provides an unbiased estimate of Kcxt 
that is 1.8 times more precise than using the global P(Koxt). 
This is the maximum precision we can obtain using the cur- 
rent calibrated halo model. 



5.3 Which halos dominate the Kh distribution? 

Before investigating the impact of imperfect knowledge of 
halo mass and redshift, we investigate the uncertainties in- 
duced by limits on the magnitude of the observed galaxy 
sample, and on the field of view. The halos in our catalogue 
are populated by galaxies and given magnitudes accordin g 
to the semi-analytic model of De Lucia fc Blaizod |20o3): 
by applying magnitude cuts to our catalogue, we can inves- 
tigate the amount of scatter caused by unobserved halos. 

Figure |4] shows the cumulative contribution to 
function of projected distance from the line of sight, mag- 
nitude and redshift. We find that most of the convergence 
comes from halos close to the line of sight. Over half of 
the convergence typically comes from halos within 30 arc- 
seconds of the line of sight; by 2 arcminutes, this fraction 
is 85%. The contribution from halos beyond 2 arcminutes 
is relatively constant at a contribution of 0.008±0.003. We 
find that ignoring halos beyond 2 arcminutes has no ef- 
fect on the precision of the reconstruction; this is shown 
in Figure [S] As a function of magnitude, we find that Kh 
is dominated by objects with magnitudes between i = 18 
and i = 24. Objects brighter than i = 18 are either too 
rare, or too close to the observer, to make a significant con- 
tribution to the convergence. Objects fainter than i = 24 
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Figure 5. The 16, 50 and 84th percentiles of kjj minus Kcxt 
as a function of the limiting radius of the halo reconstruction, 
/tjj has been shifted such that (kij) = to emulate the effect of 
the eventual calibration. The majority of the constraining power 
comes from reconstructing halos within 2 arcmin of the line of 
sight. 

are too small to be important, unless they are extremely 
close to the line of sight; we find that including halos fainter 
than i = 24 does not improve the reconstruction. Figure [5] 
shows the scatter on Kh — text (where fth is shifted so that 
its mean is zero, to emulate the effect of the eventual cal- 
ibration) as a function of reconstruction magnitude limit. 
We do not expect a deeper survey to decrease the size of 
the uncertainty in mapping Kh onto Kext. Halos at all red- 
shifts out to the source redshift (1.4 in this work. The red- 
shift of the source in B1608+656) contribute to the conver- 
gence, but the largest contribution comes from halos with 

^ ~ 2sourcc/2. 



6 TESTING THE HALO MODEL 

RECONSTRUCTION ON MOCK GALAXY 
CATALOGUES 

We now move on to consider the halo model reconstruction 
of line-of-sight mass distributions given noisy astronomical 
observables. A typical imaging survey can be expected to 
provide measurements of the positions and magnitudes of 
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Figure 6. The 16, 50 and 84th percentiles of ftjj minus Kcxt as a 
function of the Hmiting i band depth of the halo reconstruction. 
Kjj has been shifted such that (kji) = to emulate the effect of 
the eventual calibration. The majority of the constraining power 
comes from reconstructing halos with magnitudes between 18 < 
i < 24. 

galaxies in a field; spectra for some of the objects may either 
come from a synergistic survey, or from targeted follow- 
up. In this section we quantify the uncertainties induced by 
inferring the halo mass and redshift from these observables. 

As described in Section 14.11 in this work we attempt 
to infer halo masses from measurements of stellar mass. We 
will investigate two main sources of uncertainty: the stellar 
masses themselves, and placing halos at photometric red- 
shifts. Much work has already focus ed on using photo met- 
ric colours to infer stellar mass {e.g. lAueer et al1l2009l ') and 
redshifts {e.g. Bcm'tcz 2000): we will estimate the likely un- 
certainties on stellar mass and redshift based on this work. 

6.1 Making Mock Observational Catalogues 

While the Millennium Simulation catalogues do already 
contain stellar masses for each galaxy, we do not use them 
for two reasons. The first is that at the low mass end, the 
dark matter-only Millennium Simulation satellite galaxy 
halos are highly stripped relative to what we might expect 
in a universe containing baryons, leading to a mismatch be- 
tween the observed and simulated stellar mass-halo mass 
relations. The second is that we wanted to be able to per- 
form the functional test of trying to recover the conver- 
gence having assumed the correct stellar mass-halo mass 
relation. For these reasons, we assign a new true stellar mass 
to each halo in the Millennium Simulation catalogues, ac- 
cording to the empirical stella r mass -halo mass relation of 
iBehroozi. Conrov. fc Wechslej (|2010h . When assigning stel- 
lar masses we have treated satellites in the same way as 
central halos; in the real universe both the density profile 
and the stellar mass-halo mass relation are likely different 
for satellites and centrals, but our simple model does not 
include these effects. From these we simulate observed stel- 
lar masses by drawing samples from P(logM*j3g| log Af*ruo) 
which we take to be a Gaussian of width gm, centred on 
log(Af*truo). Where a spectroscopic redshift exists, stellar 
mas ses can be estimat ed with typical uncertainties of 0.15 
dex (|Auger et al.ll20o"9h ; however with photometric redshifts 
stellar mass uncertainties are typically three times as large. 
We use gm, = 0.15 dex for halos with a spectroscopic 
redshift and aM, = 0.45 dex otherwise. For photometric 
redshift uncertainties we draw a redshift from P(2:truc|2obs) 




Figure 7. The widths of the inferred PDFs P{Kc^t\T>) for lO'^ 
lines of sight, given different quality of data. Blue: spectroscopic 
redshift for every halo with i < 26; Red: spectroscopic redshift 
for every halo with i < 23, and every halo with i < 24 within 
1 arcminute, while all other objects just have photometric red- 
shifts; Green: all the objects in the field have only photometric 
redshifts. The vertical dot-dashed line marks the width of the 
global P(Kcxt) for all lines of sight. 



which we take to be a Gaussian of width 0.1(1 + ^spoc) cen- 
tred on 2:bpcc, where Zspec is the halo's true redshift in the 
Millennium Simulation catalogue. In our mock catalogues 
we have used the galaxy position from the Millennium Sim- 
ulation; this is not necessarily coincident with the centre of 
the galaxy's host dark matter halo. 

Given an uncertain stellar mass and redshift it is 
possible to infe r a halo mass using the stellar mas s-halo 
mass relation of lBehroozi. Conrov. fc Wechslei) (|2010l l. This 
procedure requires an inversi o n of the relation given in 
iBehroozi. Conrov. fc Wechsleij (|2010l ) and correctly invert- 
ing the relation's uncertainties requires care: the procedure 
we use to do this is given in Appendix [X] By drawing sam- 
ple halo masses and redshifts, we can infer a sample Kh 
using the procedure of Section [l] Repeatedly drawing sam- 
ples allows us to estimate P(fi:h|I') for each reconstructed 
line of sight. We then transform this into our target PDF 
P(Koxt|2?) as described above. 



6.2 Reconstructing Kcxt given a spectroscopic 
redshift for every object 

The best possible reconstructions of Kext will come from 
having a spectroscopic redshift for every single object in 
each field. Figure [7] shows the distribution of Koxt posterior 
PDF widths for various data qualities: complete spectro- 
scopic coverage results in the smallest widths. In terms of 
telescope time, such a reconstruction would likely be pro- 
hibitively expensive, but we investigate this scenario as an 
ideal case. 

Reconstructing the lines of sight with perfect knowl- 
edge of the redshift but an uncertain stellar mass, we find 
that the width of P(Kext|C) grows with the expectation of 
Kcxt and the ray-traced Ktrue (Figure [S]). The low Kcxt lines 
of sight are relatively empty, and so there are few oppor- 
tunities for uncertainties in the halo masses to propagate 
into Koxt uncertainties; low Koxt lines of sight can be recon- 
structed more precisely than high Kcxt lines of sight. 
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Figure 8. Width of the inferred PDF P(Kext|I') as a function 
of the true convergence, Koxt'^i different data quahties. 

Black assumes only photometric redshifts for all objects, while 
red assumes a campaign of targeted spectroscopy. The region 
between the solid (dashed) lines contains 68% (95%) of lines-of- 
sight. 



6.4 Reconstructing Kcxt with partial spectroscopic 
coverage 

While a fully photometric reconstruction provides useful 
constraints on P(Kcxt|I'), targeted spectroscopy can pro- 
vide additional constraints on the masses and redshifts of 
halos whose Kh contributions have the largest absolute un- 
certainty. Obtaining spectra of bright (i < 22) objects is 
relatively fast with large telescopes: however, we find that if 
spectroscopic redshifts were known for all i < 22 galaxies in 
our fields the reconstruction improves only slightly over the 
purely photometric reconstruction, although the improve- 
ment can depend strongly on the details of the particular 
line of sight. Obtaining spectra for fainter objects would be 
correspondingly more expensive, but if spectroscopic red- 
shifts could be obtained for all i < 23 galaxies (as part of 
a futuristic baryon acoustic oscillation survey, for example) 
and all i < 24 galaxies within 1 arcminute of the line of 
sight, then we find that the P{Kcxt\'D) would be almost as 
precise as that from having complete spectroscopic cover- 
age (Figure [T]!. If any object is extremely close to the lines 
of sight our approximation of weakly lensing mass-sheets 
will break down; also, neglecting baryonic effects will also 
be a poor approximation; a spectrum will be needed to ad- 
equately model systems with very well aligned pertrubers. 
Consistent with our previous findings, the lowest ftoxt lines 
of sight have the most constrained ftoxt PDFs. 



6.3 Reconstructing Koxt from photometry alone 

Inferring the stellar mass of a galaxy from its magnitude and 
colours requires an estimate of how far away the galaxy is; 
without a spectroscopic redshift the inferred stellar mass is 
less precise. However, obtaining photometry has much lower 
observational cost; all upcoming large area photometric sur- 
veys will reach 24th magnitude, providing sufficient data to 
reconstruct lines of sight without additional observations. 
In principle the photometric redshift is correlated with the 
inferred stellar mass, however we do not model this effect 
since the convergence from the outskirts of an individual 
halo is only weakly dependent on redshift at fixed mass due 
to the breadth of the lensing kernel: the redshift uncertainty 
has only a small efi'ect on the inferred P(Kcxt|I') in compar- 
ison to that of the uncertain stellar masses. With only pho- 
tometric redshifts, the uncertainty on Kh is larger than in 
the spectroscopic case, and this propagates into a broader 
P(Kcxt|I'), as can be seen in Figure [T] However, the pho- 
tometric reconstruction still typically produces a 50% im- 
provement compared to the precision of the global P(Koxt). 

With photometric data alone, K^""^ can shift by ~0.01 
relative to k™*"* given spectroscopic coverage. Since the Koxt 
contribution of voids cannot change with data quality, the 
shift in must be due to the asymmetric propagation 

of stellar mass uncertainties into Kcxt uncertainties. As the 
stellar mass uncertainty increases, k^^'^ is pushed higher. 
To correctly calibrate P(Koxt|Kh'°'*, one must include the 
data quality used to generate k™''^'. 

We find that the width of P(Koxt|2?) grows with the 
expectation value of Kcxt ; this is shown in Figure [S] The 
low Koxt lines of sight are relatively empty, and so there are 
few opportunities for uncertainties in the halo masses to 
propagate into Kcxt uncertainties. 



7 SYSTEMATIC ERRORS DUE TO SAMPLE 
SELECTION 

While it is good to reconstruct Koxt precisely, it is more 
important that ftoxt is reconstructed accurately. The recon- 
struction must not be biased. We quantify the accuracy of 
our reconstructed P{Kext\'D) in the following way. By shift- 
ing the inferred P(Kext|25) for each line of sight by the true 
(ray-traced) convergence, we obtain a PDF that should be 
centred on zero; these offset PDFs can be multiplied to- 
gether for multiple lines of sight to emulate a joint likelihood 
analysis, and then test for possible bias in it: 



rN=Y[P^{ 



true 



\V) 



(9) 



In this section, we quantify the bias as the size of the de- 
viation in the expectation value of Vn from zero. If the 
bias of Vn is smaller than the half- width of Vn, our Koxt 
inferences can be considered accurate. Since the width of 
Vn decreases as more sightlines are combined, a small bias 
will always eventually be found. In the context of measuring 
time-delay distance at the one percent level, the expecta- 
tion value of \Vn\ needs to be significantly less than 0.01. 
External convergence is not the only source of uncertainty 
in measuring time-delay distance. Since the statistical dis- 
tance uncertainties due to lens modelling and time delay 
estim ation are likely to be at the 3-4% level (|Suvu et al.l 
l2010h . convolving Vn with a Gaussian of width O.OA/y/N is 
a reasonable approximation for the likely final uncertainty 
on time-delay distance. 

The source of systematic error that we investigate in 
this paper is that of sample selection: we define three dif- 
ferent example selections of lines of sight, and compute the 
bias that would result (if left unaccounted for) in each one 
in turn, as a function of data quality. The results described 
below are illustrated in Figure [S] 
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Figure 9. Accuracy in Kext from combining samples of lenses. These plots show the expectation value of rii^i Pi{i^cxt — ''^oxt^l^) 
- deviations from zero represent biases. The solid, dashed and dotted lines correspond to combinations of 20, 5 and 2 sightlines 
respectively. Black lines show the results inferred from photometry alone, whilst red lines show the results from the same targeted 
spectroscopy campaign described in Section 16.41 The three panels correspond to samples of sightlines selected in different ways. Left: 
randomly selected lines of sight. Centre: sightlines randomly selected from the 33% of lines whose P(Kext) is most tightly constrained 
by our model. Right: lines of sight with external shear of 0.05 or greater. 



7.1 Selecting random lines of sight 

We first consider a purely random selection of lines of sight. 
With the purely photometric reconstruction, there is no ev- 
idence of bias, even after combining 20 lines of sight. This 
result is a sanity check: if we had chosen lines of sight with 
identical k™'^'* rather than fi:™°'* ± 0.003, randomly selected 
lines of sight would have zero bias by construction. If lines 
of sight hosting time delay lenses are truly random, our 
method allows Kext to be reconstructed without inducing a 
bias (under the basic assumptions that the universe is like 
the calibration simulation, and the correct stellar-mass to 
halo-mass relation has been used). 

7.2 Selecting only the lines of sight with narrow 

P(«:ext|©) 

Since follow-up campaigns (such as high resolution imaging, 
or time variability monitoring) are expensive, it is likely that 
only a subset of detected objects will be observed further. 
By pre-selecting objects that have the best-constrained con- 
vergence, the cosmological value per lens can be increased. 
However, this selection (like any other) has the potential 
to induce a bias. It is likely that only photometry will be 
available at the time of sub-sample selection (although spec- 
troscopic follow-up might be conducted at a later date). We 
mimic such a selection by drawing lines of sight only from 
those in the lowest third of (Jk, given a purely photometric 
reconstruction of their fields. 

We find that selecting the most tightly constrained lines 
of sight in this way does not introduce a significant bias: our 
reconstruction is most accurate for the lowest Kcxt lines of 
sight. Combining 20 such systems, the bias is at the 0.001 
level. 

Photometry of the field seems to be sufficient to ad- 
equately select these empty lines of sight. However, since 
lenses are typically massive galaxies and hence found in lo- 
cally over-dense environments, it is unlikely that an almost 
empty line of sight would actually contain a lens; what we 
can say is that if the local environment can be identified 
spectroscopically, and then the line of sight mass distribu- 



tion reconstructed using our method, selecting lenses with 
low, well-constrained values of Kcxt will lead to an increase 
in distance precision at no cost in bias. 



7.3 Selecting only high shear lines of sight 

Time delay lenses are often selected for follow-up based on 
their images' brightness (to make monitoring observations 
less expensive), their images' separation (to decrease the 
covariance between the ground-based image light-curves), 
or the fact that they are quadruply-imaging rather than 
doubly-imaging (since this yields 3 time delays instead of 
1, a higher magnification and a more informative Einstein 
Ring system). All three of these properties favour lenses 
with high convergence and shear along their lines of sight. 
Focusing on the quad selection, we might expect many of 
these systems to have significant external shear (since the 
cross-section for 4 image production is so sensitive to ellip- 
ticity in the lens mass distribution (or equivalently, external 
shear due to the lens' environment). 

We can test the impact of such a shear selection by 
selecting only lines of sight with an external shear above 
some threshold. We define a somewhat extreme selection 
of 7Gxt > 0.05, and find that in this sample, Kcxt would 
be systematically underestimated at the ~0.008 level (cor- 
responding to a 0.8% systematic error in distance). In the 
absence of other sources of uncertainty, this systematic error 
would be significant in a sample of just 5 lenses; when in- 
cluding other time-delay distance uncertainties a 0.8% bias 
would be significant for ~20 lenses. This systematic error 
is mitigated if only the lines of sight with both 7oxt > 0.05 
and well constrained P(fi:extjl') are used, but in this case a 
bias at the ~0.005 level remains. 

While 7(.xt > 0.05 is likely to be a much stronger se- 
lection than would occur in reality, it is nevertheless worth 
noting from this example that the reconstruction procedure 
can be biased if lens selection functions are extreme and un- 
accounted for. Since our model does not include shear con- 
straints it is not surprising that a selection function based 
on shear can induce a bias. A more sophisticated model that 
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includes the shear recovered from the lens modelling might 
be less susceptible to this bias. 

The halo model can also be used to estimate the ex- 
ternal shear along a line of sight; shear is an observable 
that can be extracted from strong lens modelling. How- 
ever, there is a degeneracy between internal and external 
shear. When the Einstein Ring imaging data are very good 
it is possible to disen tangle external and internal shear 
{e.g. ISuyu et al.ll2010l). but there are still significant uncer- 
tainties. Wong^ al.l ( 201lh attempted to match the shear 



from strong lens models with a reconstruction of the lo- 
cal lens group environment, but found a tension between 
the strong lens mo del and the r econs truction of the envi- 
ronment. Given the I Wong et al.l (|201lh results, it is unclear 
whether the external shear from lens models can be rec- 
onciled with a line-of-sight reconstruction. Alternatively, it 
may be possible to infer external shear using weak lensing 
information from near the line of sight. If the true external 
shear can be measured, it provides an additional constraint 
on which of the Millennium Simulation lines of sight are 
similar to the reconstructed line of sight. [Suvu et alT(|2012l l 
found that in the case of RXJ1131-1231, combining shear 
constraints with galaxy number count over-density gave a 
significantly different P(Koxt|7, Af45) compared to the PDF 
from number count over-density alone, P(«:cxt|A^45). 

Extending our model to include shear we find that given 
perfect knowledge of the halo mass and redshift the ray- 
traced external shear 7 and the reconstructed external shear 
7h are similar, with 68 percent of lines obeying 



!7cxt -7hi < 0.025 



(10) 



Future work should investigate whether 7h can be used to 
improve the accuracy and precision of Kcxt estimation, given 
a reconstruction of the line of sight. 



8 SYSTEMATIC ERRORS DUE TO 

ASSUMING AN INCORRECT STELLAR 
MASS-HALO MASS RELATION 

Throughout this work we have assumed that the universe's 
halos are populated with galaxies whose stellar masses 
are de termined purely by the lBehroozi. Conrov. &i Wechslen 
Stellar Mass-Halo Mass relation, which we then use 
in our reconstruction. In practice, the true Stellar Mass- 
Halo Mass relation may well be different to the one we as- 
sume in our inference, and if it is, a systematic error could 
be incurred. It is hard to test the size of this systematic 
error, since we do not know how much the real universe 
differs from the chosen Stellar Mass-Halo Mass relation. 
As wider and deeper surveys are conducted more data will 
become available with which to construct the Stellar Mass- 
Halo Mass relation; this should drive the inferred Stellar 
Mass-Halo Mass relation closer to the truth. We are only 
interested in testing the effect of changing the Stellar Mass- 
Halo Mass relation in a way that is consistent with observa- 
tional constraints; improving the observational constraints 
on the Stellar Mass-Halo Mass relation will decrease the 
the potential size of the systematic error on reconstructed 
Kext induced by assuming a specific Stellar Mass-Halo Mass 
relation. 

To estimate the potential size of this systematic error, 
we repeat the analysis of the previous section, still using the 
the Behroozi et al. relation to generate the stellar masses 
of our calibration lines of sight and hence carry out the 



inference, but now using simulated catalogue datasets that 
were generated from the stellar mass-halo mass relation of 
Moster et al. 

This relation provides a comparable fit to the stellar 
mass function to the Behroozi et al relation, so represents a 
plausible alternative to our assumed form. For halo masses 
of ~ 10^^ Mq, both stellar mass-halo mass relations predict 
a stellar mass of ~ 10^"- '^Mq, but for halos more massive 
than this the stellar masses generated from the Moster re- 
lation are systematically higher than those generated from 
the Behroozi relation. At halo masses of ~ IO^^Mq the pre- 
dicted stellar masses differ by some 0.25 dex. 

Applying our Behroozi-based reconstruction to lines of 
sight with Moster stellar masses results in systematic over- 
estimates of Kcxt. In Figure [To] we show this bias emerging 
after combining several lines of sight. With a purely pho- 
tometric reconstruction there is a typical systematic bias 
of ~ 0.025 on the reconstructed convergence. This can be 
shrunk to ~ 0.01 if only the low Kcxt lines of sight are used; 
however for the high shear lines of sight the bias is ~ 0.04. It 
is not too surprising that the low Koxt lines of sight are least 
affected by changes in the stellar mass-halo mass relation. 
Low Kext lines of sight are relatively empty, their Kh values 
are small, and they do not depend as strongly on the stellar 
mass-halo mass relation. In contrast, the high shear lines 
of sight have significantly overestimated Kcxt values, since 
they tend to lie close to massive halos - the regime where 
the Moster and Behroozi relations are most different. In- 
terestingly, overestimating the fth values of high shear lines 
of sight pushes them into the region where the Kh to Kext 
calibration is least certain; this significantly decreases the 
precision of the reconstructed Kcxt- 

Despite these systematic errors, we find that the pho- 
tometric reconstruction is still sufficient to choose the best 
lines of sight for spectroscopic follow-up. With targeted 
spectroscopy the systematic error decreases to ~ 0.014 for 
the ensemble, ~ 0.005 for the low kappa sample and ~ 0.018 
for the high shear sample. These results provides further 
motivation for prioritizing the lenses that reside on the 
under-dense lines of sight. 

The systematic errors reported here are perhaps overly 
pessimistic, because the bias is due to differences in the 
stellar masses predicted for high mass halos. If new obser- 
vations can discriminate between the high mass end of the 
Behroozi et al. and Moster et al. relations, then the sys- 
tematic error on the reconstruction will decrease. Similarly, 
incorporating additional information about the high mass 
group and cluster-scale halo systems, such as their richness 
or other occupation statistics, would help mitigate the stel- 
lar to halo mass conversion error. 



9 DISCUSSION 

The total convergence along a line of sight is strongly 
correlated with the reconstructed Kh- However, since our 
model ignores voids and assumes all halos follow a spher- 
ical truncated-NFW profile our halo model does not in- 
clude all of the relevant physics; hence, the width of 
our resulting P(Avext|C) is still typically ~0.01 for any 
given lightcone, even assuming perfect knowledge of every 
halo's virial mass and redshift. To make further progress a 
more ad vanced treatment of b oth voids and halos will be 
needed. iKarpenka et al ] l|2012l l find that their supernova 
data prefer, in the context of their simple halo model, a 
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Figure 10. Systematic bias in Kext due to reco nstructing lines of s ight with the wrong stellar mass-halo mass relation. The real 
st ellar masses were created usin g the relation of iMoster et alj ll201Cll l , but the reconstruction and calibration assumes the relation 
of iBehroozi. Conrov. fc Wechsleil ll201Cll V These plots show the expectation value of rifci Pi(i^ext — '^ext'^i^) ^ deviations from zero 
represent biases. The solid, dashed and dotted lines correspond to combinations of 20, 5 and 2 sightlines respectively. Black lines show 
the results inferred from photometry alone, whilst red lines show the results from the same targeted spectroscopy campaign described 
in Section 16.41 The three panels correspond to samples of sightlines selected in different ways. Left: randomly selected lines of sight. 
Centre: sightlines randomly selected from the 33% of lines whose P(K;ext) is most tightly constrained by our model. Right: lines of sight 
with external shear of 0.05 or greater. 



truncated isothermal profile for the galaxy mass distribu- 
tions. Galaxy-galaxy and cluster-galaxy lensing studies (e.g. 
Gavazzi et al.l 120071 : Ijohnston et al.l l2007l : iLagattuta et al.l 
also suggest an approximately isothermal profile for 
the mass-shear correlation function; however, at large radii 
these profiles are dominated by the effects of neighbouring 
halos, which are already taken into account by our model. 
Further iterations on the analysis presented here should in- 
clude the stellar mass distributions that cause the isother- 
mal density profiles in the centres of galaxies, although we 
do not expect this to have a big impact on the reconstruc- 
tions. Likewise, halo ellipticity may have some small effect 
on the predicted convergence. It is possible that baryonic 
physics may alter the radial profile of dark m atter halos, 
even on large scales. ISemboloni et al.l (|2012l 'l found that 
baryonic physics had a 30-40% impact on three-point shear 
statistics, it is unclear how baryons affect one-point conver- 
gence statistics compared to the pure dark matter simula- 
tions used in this work. 

Clusters, filaments and voids are more difficult to 
model, but simple forms could in principle be derived from 
the structures found in the simulations, and included in the 
halo model in some way. Most importantly for the high- 
est Kex t lines of sight will be a b etter group a nd cl uster 
model: iMomcheva et aD (120061 1 and I Wong et all (|201lh rec- 
ommend using models where dark mass is assigned to both 
galaxy-scale halos and halos for the groups and clusters they 
occupy. To make this work, one could incorporate a group 
and cluster fin der, and then infer halo mass from optical 
richness {e.g. iKoester et al.l [20071 ) in tandem with BCG 
stellar mass. A group finder would also allow us to iden- 
tify central and satellites galaxies; in this work we have ne- 
glected the differences between central halos and satellites. 
However, given perfect knowledge of halo masses we found 
that only in 5 percent of sightlines do satellites contribute 
over 20 percent of the sightline's total Kcxt- Introducing 
a group finder is unlikely to provide major improvement 
to the reconstruction of satellites. Meanwhile, improving 
the modelling of voids could be done using suggestions by 



iKrause et ej] (|2012l ') or lHiguc hi et al.l ll2012h corn bined with 
a void finder such as in [Foster fc NelsonI ||2009D . Combin- 
ing an advanced halo model with a void model will allow a 
direct test of the halo approximation: using real photomet- 
ric data one could reconstruct lines of sight and compare 
against measured weak lensing shears. This would ensure 
that the reconstruction model reproduces the contents of 
the real universe, rather than a particular simulation. 

The reduction of P{Kh\T>) to just its median value, 
before using it to look up the appropriate P(Kcxt) from 
the simulated sightli ne ensemble , is rem inisce nt of the pro- 
cedure followed by ISuvu et"al ] (|2010h . and [c reene et al.l 
(|2013D . The reconstruction method presented here can be 
seen as the limiting case of the weightings explored by 
Greene et al: rather than using weighted number counts 
in an aperture, we treat each galaxy individually, and pre- 
dict K™obs for use in their place. Given photometry alone, 
our reconstruction represents a ~50% improve ment of the 
precis ion compared the number counts used in ISuvu et al.l 
(|2010h . Greene et al. made important progress for the most 
over-dense lines of sight; our reconstructions are ~30% more 
precise than those of Greene et al. The use of relative over- 
densities by Suyu et al. and Greene et al. may decrease 
their sensitivity to the choice of calibration simulation; this 
work's use of absolute convergence may be less robust to 
changes in the calibration simulation. 

In this work, we have used the Millennium Simulation 
as a calibration tool. If the real Universe is not like the Mil- 
lennium Simulation, then this calibration will be incorrect. 
Repeating this study using a different simulation to make 
the test catalogues would allow us to quantify the sensi- 
tivity of our results to the simulation used. Similarly the 
use of a different semi-analytic model to paint galaxies onto 
halos would enable simil ar exploration of these systematic 
errors. ISuvu et al.l (|20ld 'l used the relative over-density to 
mitigate the effect of simulation and simulation, but it does 
not completely remove the dependence on simulation. We 
have tested the effect of using a different stellar mass-halo 
mass relation and found that this can induce significant sys- 
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tematic biases if the true relation is unknown. Future work 
should include investigating the robustness of the recon- 
struction to these systematic errors and develop prescrip- 
tions that mitigate against them 



10 CONCLUSIONS 

In this work we have investigated a simple halo model pre- 
scription for reconstructing all the mass along a line of sight 
to an intermediate redshift source. We have used the ray- 
traced lensing convergence along lines of sight through the 
Millennium Simulation to calibrate estimates of the total 
convergence made by summing the convergences due to each 
object in a photometric catalogue. Having found that the re- 
construction process is accurate given this calibration and 
perfect knowledge of halo mass and redshift, we investi- 
gated the effects of reasonable uncertainties in the stellar 
mass and redshift of each halo, and propagated these un- 
certainties into a P{ncxt\'D) for each line of sight. We also 
defined three different possible line of sight selections, and 
investigated the possible bias induced by these selections. 
We draw the following conclusions: 

• Despite our model's simplicity, its reconstructed Kh val- 
ues are good tracers of ftoxt- is biased due to our igno- 
rance of voids, groups and clusters, and the unseen mass in 
small halos and filaments, but we find that this can be cali- 
brated out using the Millennium Simulation catalogues. We 
found that with perfect knowledge of every halo's mass and 
redshift, calibrated reconstruction of a typical line of sight 
gives an unbiased estimate of Kext that is not perfect, but is 
1.8 times more precise than the global P(Koxt). This factor 
quantifies the limit to which the halo model can represent 
the external convergence due to line of sight mass structure. 

• With uncertain halo masses and redshifts, we find that 
P(Kh|I?) can still be calibrated in order to infer P(fi:oxt|2?) 
from the ensemble of simulated lines of sight, but the re- 
sulting PDFs tend to be much broader than for perfect halo 
mass and redshift reconstructions. 

• It is very rare for halos further than 2 arcminutes 
from the line of sight to make a significant contribution to 
P(Kext|I'); likewise, including halos whose host galaxy is less 
luminous than i = 24 does not significantly improve our re- 
constructions. A photometric survey to this depth of a 4x4 
arcminute patch around the target would approach the lim- 
iting uncertainties of our simple reconstruction recipe, and 
yield a P{Kext\T>) that has, on average, a width of 0.016 (in- 
ducing a ~1.6% uncertainty on time-delay distances). This 
is 1.5 times less broad than the global P(Koxt). 

• For the most over-dense lines of sight, the reconstruc- 
tion produces a broad PDF; conversely, we find that the 
lines of sight with the sharpest inferred P(Kext|I') are typi- 
cally under-dense. Since the reconstructions described here 
can be performed before follow-up time is invested, it will 
be possible to select targets by the precision of their Koxt 
estimates. This will be useful in an era when there are more 
known lenses than can be foUowed-up. Photometric data 
are sufficient to select the best lines of sight for follow-up. 
Spectroscopic coverage helps to tighten P(Koxt|I') for most 
lines of sight. 

• Selecting from the highest precision 33% of the recon- 
structed lines of sight results in a sample with low bias, 
~0.1% in terms of the time-delay distance. For these lines 



of sight the reconstructed P(Koxt|2?) induces a statistical 
uncertainty of ~1% on individual time-delay distances. 

• Conversely, selecting lines of sight with high shear 
(I7I > 0.05; a somewhat extreme selection, but one that 
could potentially arise from focusing on four-image or wide 
separation lenses) results in a sample with bias of 0.8% in 
time delay distance (and hence Ho). Including realistic es- 
timates of other sources of uncertainty a systematic error 
of 0.8% would become significant in a sample of ~20 lenses. 
The addition of shear constraints to our model might alle- 
viate this potential systematic. 

• Reconstructing a line of sight with an incorrect stellar 
mass-halo mass relation can introduce systematic errors on 
the inferred time-delay distance at the percent level. This 
error can be reduced by tightening the observational limits 
on the stellar mass-halo mass relation at the high mass end. 

While we have tested our reconstruction method 
against realistic simulations of line of sight mass distribu- 
tions in the Universe, our method is also calibrated to these 
simulations. Testing this assumption in the short term, and 
replacing it with an empirical calibration (or hierarchical 
inference) in the longer term, are worthy goals for future 
work. The halo model framework is flexible enough to en- 
able many such improvements, including, quite naturally, 
the incorporation of more information from observations. 
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APPENDIX A: INFERRING Ahoo GIVEN A 
NOISY MEASUREMENT OF A/'^^ 

To estimate the convergence caused by a halo we need to 
know its mass; how can halo mass be inferred from a noisy 
estimate of the stellar mass M*^^^ of a galaxy at redshift z. 
We seek the posterior PDF Pr(M2oo| A^obsi 2), which can be 
expanded as follows: 



© 2012 RAS, MNRAS 000, [THH] 



14 Collett et al. 



Pr(M2oo|Mo*b„2) = 

J dM* Pr(M2ooiM*, 2) Pr(Af*|M:b„ 2), 
oc y dA/* Pr(AC|Af*) Pr(A^*|Af200, z) Pr(A/2ool2()Al) 

where we have used Bayes' Theorem twice to replace 
Pr(M2oo I * , z) Pr ( Af * 1 2) with Pr ( Af * I M200 , z) Pt{M2oo \z), 
and to invert Pr(Af* | Af*ijj,) into the sampling distribution 
Pr(Af *|Af*t,j,), which we recognise as the likelihood func- 
tion for the observed stellar mass. Note that the "true" M* 
of the galaxy is marginalised out: we are only interested 
in inferring the halo mass. The last two terms in ( | Alll are 
the M * — A/200 relation from lBehroozi. Conrov. fc WechsleJ 
(|2010l ). and the halo mass function Pr(Af20o|z), at the given 
redshift. We can tabulate the product of these two from 
our Millennium Simulation catalogue, constructing a two- 
dimensional histogram of halo masses and their associated 
true stellar masses (drawn from the Behroozi relation). 

For each galaxy, we compute the likelihood function 
for its A'f'ijg as a function of the unknown M* , and mul- 
tiply it by our tabulated joint PDF. This heavily down- 
weights halos with M* values outside the observed range. 
We then do the marginalisation integral by Monte Carlo, 
drawing (two-dimensional) sample parameter vectors from 
the down weighted histogram, discarding the M* values, 
and constructing a one-dimensional histogram that is an 
estimate of Pr(Af2oo| Af*ija). 

If the redshift of the galaxy is uncertain, we need to 
take this uncertainty into account; for example, for each 
sample drawn from the photometric redshift posterior PDF 
Pr(2|colors), we can draw a sample A//200 using the above 
procedure. 
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